Scene segment playing system, method and recording medium thereof

ABSTRACT

A scene segment playing system, a scene segment playing method and a recording medium thereof are provided. The system includes media supply equipment, a description generating server, a scene server, and an end device. The media supply equipment is used for supplying media data. Upon receiving the media data, the scene server supplies scene description data corresponding to the media data. The scene server acquires the media data and interval information supplied by the end device, and retrieves scene segment data from the media data according to a comparison result of the interval information and the scene description data, and outputs the scene segment data to the end device for playing.

CROSS-REFERENCE TO RELATED APPLICATION

This application claims the benefit of Taiwan Patent Application No.101138298, filed on Oct. 17, 2012, which is hereby incorporated byreference for all purposes as if fully set forth herein.

BACKGROUND OF THE INVENTION

1. Field of Invention

The present invention relates to a scene segment playing system, a scenesegment playing method and a recording medium thereof, and moreparticularly to a scene segment playing system, a scene segment playingmethod and a recording medium thereof in which a demanded scene segmentis retrieved through scene description data.

2. Related Art

In the prior art, media data is usually played in a linear manner. Imageplaying software supplies a timeline of playing the media datacorrespondingly. A user can click a position on the timeline or drag aslider on the timeline, so as to determine an image playing segment.

However, if a user is not familiar with the played content of the mediadata and the timepoint of playing, it takes a long time for the user tofind the demanded video scene. Secondly, the precision of dragging aslider depends on the length of a timeline, and the precision ofdragging a slider also affects the precision of positioning thetimepoint of a demanded image. The longer the timeline is, the higherthe precision of dragging a slider is. Therefore, if a user intends toacquire a targeted image or voice from the media data, he/she needs tomanually control the timeline to position the slider at the timepoint ofa demanded image, so as to play the demanded image.

SUMMARY OF THE INVENTION

To solve the above problems, the present invention discloses a scenesegment playing system, a scene segment playing method and a recordingmedium thereof, in which media is selected according to scenedescription data to supply and play a demanded scene segment of a user.

The scene segment playing system disclosed in the present inventionincludes media supply equipment, a description generating server, ascene server and a first end device.

The media supply equipment is used for supplying media data. The firstend device is used for inputting interval information. The descriptiongenerating server is used for receiving the media data to supply scenedescription data corresponding to the media data. The scene server isused for acquiring the interval information, retrieving scene segmentdata from the media data according to a comparison result of comparingthe interval information and the scene description data, and outputtingthe scene segment data to the first end device for playing.

The scene segment playing method disclosed in the present inventionincludes: supplying, by media supply equipment, media data; receiving,by a description generating server, the media data, and supplying scenedescription data corresponding to the media data; acquiring, by a sceneserver, interval information supplied by a first end device; retrieving,by the scene server, scene segment data from the media data according toa comparison result of comparing the interval information and the scenedescription data; and, outputting, by the scene server, the scenesegment data to the first end device for playing.

In addition, the present invention also discloses a recording medium,which stores a program code readable by an electronic device. Whenreading the program code, the electronic device executes a scene segmentplaying method. The method is as described above.

The feature of the present invention lies in that in the presentinvention, by means of comparison between the scene description data andthe interval information, the demanded scene segment data of the usercan be supplied in a targeted manner, and the user can find the demandedvideo scene within a relatively short time. Secondly, the user canacquire the demanded scene segment data without spending much time incontrolling the timeline of the media data, so that the precision ofsupplying scene video can be maintained, the complexity of image controlcan be reduced, and the operational trouble that it is difficult for theuser to drag a slider to a demanded point is avoided. Thirdly, by meansof targeted retrieval of segment data, the user can acquire demandedscene segments once for all to form self-generated media, and propermedia abstract information can also be further generated for media data,so that not only the customized media operation that satisfies thedemand of the user is formed, but also the operational complexity forthe user is reduced. Fourthly, by means of supplying the scene segmentdata and comparison between the scene description data and the intervalinformation, the user can make personalized choices to watch demandedsegments, so that high flexibility in media operations is achieved.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will become more fully understood from thedetailed description given herein below for illustration only, and thusare not limitative of the present invention, and wherein:

FIG. 1 is a schematic structural view of a scene segment playing systemaccording to an embodiment of the present invention;

FIG. 2 is a schematic view of data flows of self-generated descriptiondata according to an embodiment of the present invention;

FIG. 3 is a schematic view of modified data flows of scene segment dataaccording to an embodiment of the present invention;

FIG. 4 is a schematic flow chart of a scene segment playing methodaccording to an embodiment of the present invention;

FIG. 5 to FIG. 8 are detailed schematic flow charts of a scene segmentplaying method according to an embodiment of the present invention;

FIG. 9 is a schematic flow chart of generating a media playing treestructure according to an embodiment of the present invention;

FIG. 10 is a schematic flow chart of a method for self-generatingdescription data according to an embodiment of the present invention;

FIG. 11 is a schematic flow chart of a method for modifying scenesegment data according to an embodiment of the present invention;

FIG. 12 is a schematic view of scene description information accordingto an embodiment of the present invention;

FIG. 13 is a schematic view of media levels according to an embodimentof the present invention;

FIG. 14 is a view of playing scene segment data according to anembodiment of the present invention; and

FIG. 15 is a schematic view of a media playing tree structure accordingto an embodiment of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Preferred embodiments of the present invention are described in detailbelow with reference to the accompanying drawings.

FIG. 1 is a schematic structural view of a scene segment playing systemaccording to an embodiment of the present invention. Referring to FIG.1, the system is applicable to a device, equipment or system with amedia playing capability, and the configuration form is not limited. Thescene segment playing system includes a client side and a server side,and equipment and devices at the two sides are connected via a network.The server side includes media supply equipment 10, a descriptiongenerating server 20 and a scene server 30. The client side includesmore than one end device, which is an electronic device of a user, forexample, an electronic device having a human-machine interface for beingoperated by a user and having a network connection capability, such as apersonal computer (PC), a notebook computer, a tablet PC (Tablet PC), asmart phone, and a set-top box (STB). In this example, a first enddevice 40 is used for illustration.

The media supply equipment 10 is used for supplying media data 11. Themedia data 11 may be a complete image, voice or video data, or streamdata transmitted in real time. The media supply equipment 10 may beequipment at the same place as the description generating server 20 andthe scene server 30, or third-party equipment at a different place,which is not limited. The modes that the media supply equipment 10outputs the media data 11 include wired and wireless data transmissionmodes such as broadcast, broadband, wired transmission (for example,Community Antenna Television, Community Antenna Television, CableTelevision, CATV), network protocol transmission (IPTV), which is alsonot limited.

The media supply equipment 10 includes hardware having a media supplycapability, or a combination of more than at least one form of a unit,component, device, equipment and system in which software and hardwareare combined. The media data 11 includes multiple same or differentscene segments. For example, when the media data 11 is image data, theimage data refers to image segments having more than one of contentssuch as targets, scenes and characters. For example, the media data 11is voice data, and the voice data refers to voice segments having morethan one of contents such as high pitch, low pitch, speeches and music.

When acquiring the media data 11, the description generating server 20supplies scene description data 21 corresponding to the media data 11.The scene description data 21 is annotation data for annotating theplaying content of the media data 11, for example, the similar data forfurther describing the scene segment such as brief introduction of thescene playing content of the media data 11, the scene playing time, andthe scene title. The modes of supplying the scene description data 21are as follows: (1) the description generating server 20 directlygenerates the scene description data 21 according to the scene playingcontent of the media data 11; and (2) the description generating server20 acquires the scene description data 21 corresponding to the mediadata 11 from an external device.

The first end device 40 includes a data input interface for a user toinput data, and also has a media playing capability. The presentationend of the interface of the first end device 40 depends on the demandsof designers, and is not limited. A user inputs interval information 41by using the first end device 40.

The interval information 41 refers to a request condition input by auser when the user intends to search for a specific scene segment fromthe media data 11. For example, the media data 11 is recorded images ofa basketball game, a user inputs a request condition such as scoringpictures of his favorite players or scoring pictures of three-pointshots of all players in the game or scoring pictures of three-pointshots of his favorite players. Further, for example, the media data 11is music data of an opera, and a user inputs a request condition such assolos of a female protagonist of the opera or absolute music segmentperformance. Even further, when a user already knows the content of themedia data 11, the user can also input more than one time interval asthe reference of searching for a scene segment.

The scene server 30 acquires the interval information 41 and the mediadata 11, but the source of acquiring the media data 11 is thedescription generating server 20 or the media supply equipment 10. Thescene server 30 compares the interval information 41 and the scenedescription data 21, retrieves scene segment data 31 from the media dataaccording to a comparison result, and outputs the scene segment data 31to the first end device 40 for playing.

However, the types of the request condition included in the intervalinformation 41 are illustrated as follows:

(1) The interval information 41 includes a start time and an end time.The scene server 30 compares the interval information 41 and the scenedescription data 21, acquires more than one scene playing time betweenthe start time and the end time from the scene description data 21, andretrieves more than one piece of target segment data 32 from the mediadata 11 according to the scene playing time, so as to form the scenesegment data 31. However, the target segment data 32 may be retrievedfrom the media data 11 according to the intervals of the scene segments,or is retrieved by directly cutting the media data 11 withoutconsidering the intervals of the scene segments in the media data 11.

(2) The interval information 41 includes content designationinformation. When comparing the interval information 41 and the scenedescription data 21, the scene server 30 acquires a scene playingcontent that meets the content designation information from the scenedescription data 21, and retrieves more than one piece of target segmentdata 32 from the media data 11 according to the scene playing content,so as to form the scene segment data 31.

(3) The scene server 30 first divides the media data 11 into more thanone piece of segment data 33 according to record information (theplaying content, playing time, or other correlated information) of thescene description data 21. The interval information 41 includes contentdesignation data. When comparing the content designation data and therecord information of the scene description data 21, the scene server 30retrieves more than one piece of target segment data 32 from all thesegment data 33 to form the scene segment data 31.

(4) The scene server 30 divides the media data 11 into a plurality ofpieces of segment data 33 according to record information of the scenedescription data 21. The interval information 41 includes timepointdata. The scene server 30 compares the timepoint data and a playingstart time and a playing end time of each piece of segment data 33 toretrieve at least one piece of target segment data 32 from each piece ofsegment data 33, where the scene playing time of the target segment data32 includes the timepoint data. The scene server 30 forms the scenesegment data 31 with the retrieved target segment data 32.

Also, the interval information 41 may include more than one type ofrequest condition, and may also include multiple different retrievaldemand request conditions. The scene server 30 may form one or morepieces of scene segment data 31 from the media data 11 according to eachrequest condition.

Upon receiving one or more pieces of scene segment, data 31, the firstend device 40 performs playing or presents a list. The user selects oneor more pieces of scene segment data 31 for playing through a controlinterface of the first end device 40.

Moreover, one or more pieces of self-generated media may be generatedfrom the scene segment data 31 according to similar rules such as theplaying order and playing time of the media data 11 and the order of therequest conditions. However, the rule is selected by the first enddevice 40 according to a playing program or setting values of software,or is input and set by the user through a control interface, which isnot limited.

Furthermore, the scene server 30 divides the media data 11 into one ormore pieces of segment data 33 according to the record information ofthe scene description data 21, and then generates a media playing treestructure from the segment data 33 according to the data dependency,data attribute and data level relationship of each piece of segment data33. When acquiring the interval information 41, the scene server 30retrieves relevant segment data 33 from the media playing tree structureaccording to the interval information 41 to form the scene segment data31.

However, the interface of the first end device 40 also presents an inputfield of the media playing tree structure, so that the user inputs eachrequest condition in each field only according to the data dependencerelationship of the request condition or the type of the data attribute,where each request condition is used as a reference for the scene server30 to retrieve and categorize the segment data 33.

However, the media supply equipment 10 may also supply a plurality ofpieces of media data 11, and the description generating server 20supplies scene description data 21 corresponding to each piece of mediadata 11. When inputting the interval information 41 by using the firstend device 40, the user may set a different request condition for eachpiece of media data 11 or set one request condition for all the mediadata 11, which depends on demands of the user.

The scene server 30 compares the relevant scene description data 21 andthe interval information 41, and forms one or more pieces of segmentdata 33 from each piece of media data 11, so as to return the segmentdata 33 to the first end device 40. Then, self-generated media thatmeets demands of the user may be generated by the user from the acquiredsegment data 33 through the first end device 40.

However, the retrieval results such as the segment data 33, the scenedescription data 21, the media playing tree structure, and the playingmedia may be stored in the scene server 30 for use by the scene segmentdata 31 during the supply operation next time. Moreover, the scenesegment data 31 generated through the scene segment supply operation mayalso be stored in the first end device 40 to be directly obtained andplayed by playing software/hardware of the first end device 40.

FIG. 2 is a schematic view of data flows of self-generated descriptiondata according to an embodiment of the present invention. Referring toFIG. 2, in this example, the first end device 40 also acquires the mediadata 11 supplied by the media supply equipment 10 and plays the mediadata 11. During the playing, the user may retrieve segments from themedia data 11 during playing through the control interface of the firstend device 40, so as to form more than one retrieved segment andestablish first self-generated description data 42 corresponding to theretrieved segments. The first self-generated description data 42includes a first segment playing time, which includes the playing starttime and the playing end time of the retrieved segment (or segments) inthe media data 11. The scene server 30 then retrieves at least one pieceof target segment data 32 within the first segment playing time from themedia data 11 according to the first self-generated description data 42,so as to form scene segment data 31 with the pieces of target segmentdata 32, and makes the first self-generated description data 42correspond to the scene segment data 31. The formed scene segment data31 may be supplied to other end devices.

FIG. 3 is a schematic view of modified data flows of scene segment dataaccording to an embodiment of the present invention. The differencebetween FIG. 3 and FIG. 2 lies in that, the system further includes asecond end device 50. The first self-generated description data 42further includes first segment content data 421, which includes theplaying content of the scene segment data 31 corresponding to the firstself-generated description data 42.

The same as the operations of the first end device 40, the second enddevice 50 may also acquire the media data 11. Also, another user mayretrieve segments from the media data 11 during playing through acontrol interface of the second end device 50, so as to form more thanone retrieved segment and establish second self-generated descriptiondata 52 corresponding to the retrieved segments. The secondself-generated description data 52 includes a second segment playingtime and second segment content data 521. The second segment playingtime includes the playing start time and the playing end time of theretrieved segment (or segments) in the media data 11. The second segmentcontent data 521 is the playing content of the retrieved segment.

When the scene server 30 acquires the second self-generated descriptiondata 52, if the scene server 30 determines that the first segmentcontent data 421 and the second segment content data 521 are the same orequivalent and the first segment playing time and the second segmentplaying time are different, the scene server 30 sets a playingmodification time according to the first segment playing time and thesecond segment time, so as to retrieve a plurality of adjustmentsegments 34 whose playing time is within the playing modification timefrom the media data 11 and update the scene segment data 31.

The playing modification time is the union or intersection of the timelengths of the first segment playing time and the second segment playingtime. Alternatively, the start time of the playing modification time isthe average value of the start time of the first segment scene playingtime and the start time of the second segment scene playing time. Theend time of the playing modification time is the average value of theend time of the first segment scene playing time and the end time of thesecond segment scene playing time. Alternatively, other relevant orsimilar setting manners may be adopted, and the present invention is notlimited to the above setting manners and depends on the demands ofdesigners.

FIG. 4 is a schematic flow chart of a scene segment playing methodaccording to an embodiment of the present invention; and FIG. 5 to FIG.8 are detailed schematic flow charts of a scene segment playing methodaccording to an embodiment of the present invention. Please refer toFIG. 1 to FIG. 3 in combination for ease of understanding. The processof the method is as follows.

Media supply equipment 10 supplies media data 11 (Step S110). Asdescribed above, the media data 11 supplied by the media supplyequipment 10 may be a complete image, voice or video data, or streamdata transmitted in real time. The modes of transmitting the media data11 include wired and wireless data transmission modes such as broadcast,broadband, wired transmission, and network protocol transmission.

A description generating server 20 receives the media data 11, andsupplies scene description data 21 corresponding to the media data 11(Step S120). As described above, the manners of supplying the scenedescription data 21 include: (1) the description generating server 20directly generates the scene description data 21 according to theplaying content of the media data 11; and (2) the scene description data21 corresponding to the media data 11 is acquired from an externaldevice.

A scene server 30 acquires interval information 41 supplied by a firstend device 40 (Step S130). The interval information 41 refers to arequest condition input by a user when the user intends to search for aspecific media segment from the media data 11. The request condition isat least one of the content designation, the playing start time andplaying end time, and timepoint designation.

The scene server 30 retrieves scene segment data 31 from the media data11 according to a comparison result of comparing the intervalinformation 41 and the scene description data 21 (Step S140). In thisstep, according to different request conditions included in the intervalinformation 41, the scene server 30 may also retrieve the scene segmentdata 31 in different modes. The illustration is as follows:

(1) As shown in FIG. 1 and FIG. 5, the interval information 41 includesa start time and an end time. In this step, when comparing the intervalinformation 41 and each piece of scene description data 21, the sceneserver 30 acquires more than one scene playing time between the starttime and the end time from the scene description data 21 (Step S141).Subsequently, the scene server 30 retrieves more than one piece oftarget segment data 32 from the media data 11 according to the sceneplaying time, so as to form the scene segment data 31 (Step S148).

(2) As shown in FIG. 1 and FIG. 6, the interval information 41 includescontent designation information. In this step, when comparing theinterval information 41 and the scene description data 21, the sceneserver 30 acquires a scene playing content that meets the contentdesignation information from the scene description data 21 (Step S142).Subsequently, the scene server 30 retrieves more than one piece oftarget segment data 32 from the media data 11 according to the sceneplaying content, so as to form the scene segment data 31 (Step S148).

(3) As shown in FIG. 1 and FIG. 7, the scene server 30 divides the mediadata 11 into more than one piece of segment data 33 according to recordinformation of the scene description data 21. The interval information41 includes content designation data. In this step, the scene server 30compares the content designation data and the record information of thescene description data 21 (Step S143), and retrieves more than one pieceof target segment data 32 from all the segment data 33, so as to formthe scene segment data 31 (Step S149).

(4) As shown in FIG. 1 and FIG. 8, the scene server 30 divides the mediadata 11 into a plurality of pieces of segment data 33 according torecord information of each piece of scene description data 21. Theinterval information 41 includes timepoint data. In this step, the sceneserver 30 compares timepoint data and a playing start time and a playingend time of each piece of segment data 33 (Step S144), and retrieves atleast one piece of target segment data 32 from all the segment data 33,so as to form the scene segment data 31 (Step S149).

Subsequently, the scene server 30 outputs the scene segment data 31 tothe first end device 40 (Step S150). Upon receiving one or more piecesof scene segment data 31, the first end device 40 plays the scenesegment data 31 or presents a list. A user then selects one or morepieces of scene segment data 31 for playing through the controlinterface of the first end device 40.

FIG. 9 is a schematic flow chart of generating a media playing treestructure according to an embodiment of the present invention. Pleaserefer to FIG. 1 in combination for ease of understanding. The methodincludes the following steps.

A scene server 30 divides a plurality of pieces of segment data 33according to record information of scene description data 21 (StepS210).

The scene server 30 generates a media playing tree structure from thesegment data 33 according to the data dependency, data attribute anddata level relationship of each piece of segment data 33 (Step S220).

In Step S140, when acquiring the interval information 41, the sceneserver 30 retrieves the segment data 33 from the media playing treestructure according to the interval information 41, so as to form thescene segment data 31.

FIG. 10 is a schematic flow chart of a method for self-generating scenedescription data according to an embodiment of the present invention.Please refer to FIG. 2 in combination for ease of understanding. Themethod includes the following steps.

A first end device 40 establishes first self-generated description data42 and transmits the first self-generated description data 42 to a sceneserver 30 (Step S310). The first self-generated description data 42includes a first segment playing time of more than one retrieved segmentin the media data 11, which includes the playing start time and theplaying end time of the retrieved segment (or segments).

The scene server 30 retrieves at least one piece of target segment data32 within the first segment playing time from the media data 11according to the first self-generated description data 42 (Step S320).

The scene server 30 forms scene segment data 31 with the target segmentdata 32, and makes the first self-generated description data 42correspond to the scene segment data 31 (Step S330).

FIG. 11 is a schematic flow chart of a modified method forself-generating scene description data according to an embodiment of thepresent invention. Please refer to FIG. 3 and FIG. 10 for ease ofunderstanding. The method includes the following steps.

A second end device 50 establishes second self-generated descriptiondata 52, which includes a second segment playing time and second segmentcontent data (Step S410). The second self-generated description data 52includes a second segment playing time and second segment content data,and the second segment playing time includes the playing start time andthe playing end time of the retrieved segment (or segments) in the mediadata 11. The second segment content data 521 is the playing content ofthe retrieved segment.

When acquiring the second self-generated description data 52, the sceneserver 30 determines whether the first segment content data 421 and thesecond segment content data 521 are the same or equivalent and whetherthe first segment playing time and the second segment playing time aredifferent (Step S420).

The scene server 30 decides, according to a determination result,whether to set a playing modification time according to the firstsegment playing time and the second segment playing time, and retrieve aplurality of adjustment segments 34 whose scene playing time is withinthe playing modification time from the media data 11, and update thescene segment data 31 (Step S430). The manner of setting the playingmodification time is as described above, which is no longer describedherein.

FIG. 12 and FIG. 15 are schematic views of situations of media controlaccording to an embodiment of the present invention. Herein, the mediadata 11 is described as recorded images of a basketball game.

FIG. 12 is a schematic view of scene description information accordingto an embodiment of the present invention, in which a scene instructionand a scene corresponding time of images of a basketball game arepresented.

When a user merely wants to watch “scoring pictures in the thirdquarter” and knows the playing timepoint of the “scoring pictures in thethird quarter” in the media data 11, this playing timepoint may bedirectly set in the interval information 41, for example, the countdownof the third quarter “11:39”, “10:50”, “10:49”, “09:39”, “09:16”,“08:58”, “08:44”, “08:29”, “08:07”, “07:47”, “07:35”, or the relevanttimepoints. The scene server 30 uses the time before or after thetimepoints or the timepoints as the center to retrieve correspondingscene segment data 31 to be played by the relevant end device.

Alternatively, when the demand condition set by the user is “scoringpictures of three-point shots in the third quarter”, the scene server 30takes the countdown of the third quarter “11:39”, “09:16”, “08:58”,“07:47” and so on and timepoints corresponding to the “scoring scenes ofthree-point shots in the third quarter”. The scene server 30 uses thetime before or after the timepoints or the timepoints as the center toretrieve corresponding scene segment data 31 to be played by therelevant end device.

FIG. 13 is a schematic view of media levels according to an embodimentof the present invention. The recorded images of the basketball game canbe divided into different image levels. The highest level refers toimages of the whole game, the next level refers to images of allquarters, and the next level refers to images of close-up shots. Thewhole images are formed of multiple pieces of segment data 33, andcorrespond to the scene description information 21. However, each levelcan be regarded as the basis for a media division mode.

FIG. 14 is a view of playing scene segment data according to anembodiment of the present invention. With the media division mode of thethird level as an example, the selected scene section media 32 includessegments P1, P3 and P5. The relevant playing software, program or modulejumps to the starting point of the segment P3 and starts to play thesegment P3 after the segment P1 is played. In the same way, afterplaying the segment P3, the playing software, program or module jumps tothe starting point of the segment P5 and starts to play the segment P5.During media forwarding, the relevant playing software also performs theforwarding operation segment by segment with the segment as a unit, ordirectly designates segments for performing the forwarding, for example,forwarding from the segment P1 to the segment P5. On the other hand,during image rewinding operation, rewinding from the segment P5 to thesegment P3 is performed, and then rewinding from the segment P3 to thesegment P1 is performed. Alternatively, the relevant playing softwaredirectly designates segments for rewinding, for example, from thesegment P5 to the segment P1 or the starting point of the film.

FIG. 15 is a schematic view of a media playing tree structure accordingto an embodiment of the present invention. Herein, by combining theimage levels shown in FIG. 12 and the data dependence relationships orthe types of data attributes of the segment data 33 recorded in thescene description data 2, a media playing tree structure may begenerated from the segment data 33 of the whole recorded images.

The first level of the media playing tree structure is the whole gameimage. The second level is branches of the first level images, which areimages of both teams. The third level is branches of the second levelimages, which are close-up images of the two teams in the game. Thefourth level is branches of the third level images, which are theclose-up shot images of specific players from the two teams in the game.

After the user sets the interval information 41, the scene server 30retrieves the demanded target segment data 32 from the media data 11through the media playing tree structure according to the demandcondition included in the interval information 41, so as to form theabove self-generated image to be played by a relevant playing module.

However, the segment data 33 retrieved by the scene server 30 does notneed to consider the above image levels. For example, when the userwants to watch “all scoring pictures of No. 2 player of Team A” and thenwatch “whole images of the fourth quarter”, the scene server 30retrieves segment data corresponding to “all scoring pictures of No. 2player of Team A” according to the fourth level structure of the mediaplaying tree structure, retrieves the segment data corresponding to the“whole images of the fourth quarter” according to the second levelstructure of the media playing tree structure, and then, according tothe media retrieval manner, forms the demanded self-generated images ofthe user to be played by the relevant playing module. That is, the sceneserver 30 may retrieve the segment data of the same level, differentlevels or partially the same and partially different levels from themedia playing tree structure, and integrate the segment data into scenesegments to be played by the end device.

The invention being thus described, it will be obvious that the same maybe varied in many ways. Such variations are not to be regarded as adeparture from the spirit and scope of the invention, and all suchmodifications as would be obvious to one skilled in the art are intendedto be included within the scope of the following claims.

1. A scene segment playing system, comprising: a media supply equipment,used for supplying media data; a description generating server, used forreceiving the media data, so as to supply scene description datacorresponding to the media data; a scene server, used for acquiringinterval information and the media data, and according to a comparisonresult of comparing the interval information and the scene descriptiondata, retrieving scene segment data from the media data, wherein thescene server divides the media data into a plurality of pieces ofsegment data according to record information of the scene descriptiondata, and generates a media playing hierarchical structure for eachpiece of segment data according to the data dependency, data attributeand data level relationship of the segment data, wherein when acquiringthe interval information, the scene server retrieves at least one pieceof target segment data from the media playing hierarchical structureaccording to the interval information to form the scene segment data;and a first end device, used for inputting the interval information, andacquiring and playing the scene segment data.
 2. The scene segmentplaying system according to claim 1, wherein the interval informationcomprises a start time and an end time, and when comparing the intervalinformation and the scene description data, the scene server acquires atleast one scene playing time between the start time and the end timefrom the scene description data, and retrieves at least one piece oftarget segment data corresponding to the at least one scene playing timefrom the media data to form the scene segment data.
 3. The scene segmentplaying system according to claim 1, wherein the interval informationcomprises content designation information, and when comparing theinterval information and the scene description data, the scene serveracquires at least one scene playing content that meets the contentdesignation information from the scene description data, and retrievesat least one piece of target segment data corresponding to the at leastone scene playing content from the media data to form the scene segmentdata.
 4. The scene segment playing system according to claim 1, whereinthe scene server divides the media data into a plurality of pieces ofsegment data according to record information of the scene descriptiondata, the interval information comprises content designation data, andthe scene server compares the content designation data and the scenedescription data, and retrieves at least one piece of target segmentdata from the pieces of scene segment data to form the scene segmentdata.
 5. The scene segment playing system according to claim 1, whereinthe scene server divides the media data into a plurality of pieces ofsegment data according to record information of the scene descriptiondata, the interval information comprises timepoint data, and the sceneserver compares the timepoint data and a playing start time and aplaying end time of each piece of segment data to retrieve at least onepiece of target segment data, so as to form the scene segment data. 6.The scene segment playing system according to claim 1, wherein the firstend device is used for establishing first self-generated descriptiondata and transmitting the first self-generated description data to thescene server, the first self-generated description data comprises afirst segment playing time corresponding to the media data, and thescene server retrieves at least one piece of target segment data withinthe first segment playing time from the media data according to thefirst self-generated description data to form the scene segment data,and makes the first self-generated description data correspond to thescene segment data.
 7. The scene segment playing system according toclaim 6, wherein the first self-generated description data comprisesfirst segment content data, the system further comprises a second enddevice, used for establishing second self-generated description data,the second self-generated description data comprises a second segmentplaying time and second segment content data, and when acquiring thesecond self-generated description data, and determining that the firstsegment content data and the second segment content data are the same orequivalent and the first segment playing time and the second segmentplaying time are different, the scene server sets a playing modificationtime according to the first segment playing time and the second segmentplaying time, so as to retrieve a plurality of adjustment segments whosescene playing time within the playing modification time from the mediadata and update the scene segment data.
 8. The scene segment playingsystem according to claim 7, wherein the playing modification time isthe union or intersection of the time lengths of the first segment sceneplaying time and the second segment scene playing time.
 9. The scenesegment playing system according to claim 7, wherein the start time ofthe playing modification time is the average value of the start time ofthe first segment playing time and the start time of the second segmentplaying time, the end time of the playing modification time is theaverage value of the end time of the first segment playing time and theend time of the second segment playing time.
 10. (canceled) 11.(canceled)
 12. A scene segment playing method, comprising: supplying, bymedia supply equipment, media data; receiving, by a descriptiongenerating server, the media data, and supplying scene description datacorresponding to the media data; acquiring, by a scene server, intervalinformation supplied by a first end device; retrieving, by the sceneserver, scene segment data from the media data according to a comparisonresult of comparing the interval information and the scene descriptiondata; dividing the media data into a plurality of pieces of segment dataaccording to record information of the scene description data, andgenerating a media playing hierarchical structure for each piece ofsegment data according to the data dependency, data attribute and datalevel relationship of the segment data by the scene server; retrievingat least one piece of target segment data from the media playinghierarchical structure according to the interval information to form thescene segment data by the scene server, when the scene server acquiresthe interval information; and outputting, by the scene server, the scenesegment data to the first end device for playing.
 13. The scene segmentplaying method according to claim 11, wherein the interval informationcomprises a start time and an end time, and the step of retrieving, bythe scene server, scene segment data from the media data according to acomparison result of comparing the interval information and each pieceof scene description data comprises: when comparing the intervalinformation and the scene description data, acquiring, by the sceneserver, at least one scene playing time between the start time and theend time in the scene description data; and retrieving, by the sceneserver, at least one piece of target segment data corresponding to theat least one scene playing time from the media data to form the scenesegment data.
 14. The scene segment playing method according to claim11, wherein the interval information comprises content designationinformation, and the step of retrieving, by the scene server, scenesegment data from the media data according to a comparison result ofcomparing the interval information and each piece of scene descriptiondata comprises: when comparing the interval information and the scenedescription data, acquiring, by the scene server, at least one sceneplaying content that meets the content designation information from thescene description data; and retrieving, by the scene server, at leastone piece of target segment data corresponding to the at least one sceneplaying content to form the scene segment data.
 15. The scene segmentplaying method according to claim 11, wherein the scene server dividesthe media data into a plurality of pieces of segment data according torecord information of each piece of scene description data, the intervalinformation comprises content designation data, and the step ofretrieving, by the scene server, scene segment data from the media dataaccording to a comparison result of comparing the interval informationand each piece of scene description data comprises: comparing, by thescene server, the content designation data and the scene descriptiondata, and retrieving at least one piece of target segment data from thepieces of scene segment data to form the scene segment data.
 16. Thescene segment playing method according to claim 11, wherein the sceneserver divides the media data into a plurality of pieces of segment dataaccording to record information of each piece of scene description data,the interval information comprises timepoint data, and the step ofretrieving, by the scene server, scene segment data from the media dataaccording to a comparison result of comparing the interval informationand each piece of scene description data comprises: comparing, by thescene server, the timepoint data and a playing start time and a playingend time of each piece of scene segment data to retrieve at least onepiece of target segment data, so as to form the scene segment data. 17.The scene segment playing method according to claim 11, furthercomprising: establishing, by the first end device, first self-generateddescription data and transmitting the first self-generated descriptiondata to the scene server, wherein the first self-generated descriptiondata comprises a first segment playing time corresponding to the mediadata; retrieving, by the scene server, at least one piece of targetsegment data within the first segment playing time from the media dataaccording to the first self-generated description data; and forming, bythe scene server, the scene segment data with the at least one piece oftarget segment data, and making the first self-generated descriptiondata correspond to the scene segment data.
 18. The scene segment playingmethod according to claim 17, wherein the first self-generateddescription data comprises first segment content data, and the methodfurther comprises: establishing, by a second end device, secondself-generated description data, wherein the second self-generateddescription data comprises a second segment playing time and secondsegment content data; when acquiring the second self-generateddescription data, determining, by the scene server, whether the firstsegment content data and the second segment content data are the same orequivalent and whether the first segment playing time and the secondsegment playing time are different; and deciding, by the scene server,according to a determination result, whether to set a playingmodification time according to the first segment playing time and thesecond segment playing time, so as to retrieve a plurality of adjustmentsegments whose scene playing time within the playing modification timefrom the media data and update the scene segment data.
 19. The scenesegment playing method according to claim 18, wherein the playingmodification time is the union or intersection of the time lengths ofthe first segment playing time and the second segment playing time. 20.The scene segment playing method according to claim 18, wherein thestart time of the playing modification time is the average value of thestart time of the first segment playing time and the start time of thesecond segment playing time, and the end time of the playingmodification time is the average value of the end time of the firstsegment playing time and the end time of the second segment playingtime.
 21. A non-transitory recording medium, storing a program codereadable by an electronic device, and when reading the program code, theelectronic device executes a scene segment playing method, and themethod comprises the following steps: supplying, by media supplyequipment, media data; receiving, by a description generating server,the media data, and supplying scene description data corresponding tothe media data; acquiring, by a scene server, interval informationsupplied by a first end device; retrieving, by the scene server, scenesegment data from the media data according to a comparison result ofcomparing the interval information and the scene description data;dividing the media data into a plurality of pieces of segment dataaccording to record information of the scene description data, andgenerating a media playing hierarchical structure for each piece ofsegment data according to the data dependency, data attribute and datalevel relationship of the segment data by the scene server; retrievingat least one piece of target segment data from the media playinghierarchical structure according to the interval information to form thescene segment data by the scene server, when the scene server acquiresthe interval information; and outputting, by the scene server, the scenesegment data to the first end device for playing.
 22. The non-transitoryrecording medium according to claim 21, wherein the method furthercomprises: establishing, by the first end device, first self-generateddescription data and transmitting the first self-generated descriptiondata to the scene server, wherein the first self-generated descriptiondata comprises a first segment playing time corresponding to the mediadata; retrieving, by the scene server, at least one piece of targetsegment data within the first segment playing time from the media dataaccording to the first self-generated description data; and forming, bythe scene server, the scene segment data with the at least one piece oftarget segment data, and making the first self-generated descriptiondata correspond to the scene segment data.
 23. The non-transitoryrecording medium according to claim 22, wherein the first self-generateddescription data comprises first segment content data, and the methodfurther comprises: establishing, by a second end device, secondself-generated description data, wherein the second self-generateddescription data comprises a second segment playing time and secondsegment content data; when acquiring the second self-generateddescription data, determine, by the scene server, whether the firstsegment content data and the second segment content data are the same orequivalent and whether the first segment playing time and the secondsegment playing time are different; and deciding, by the scene server,according to a determination result, whether to set a playingmodification time according to the first segment playing time and thesecond segment playing time, so as to retrieve a plurality of adjustmentsegments whose scene playing time within the playing modification timefrom the media data and update the scene segment data.